Differentially Private Chi-Squared Hypothesis Testing: Goodness of Fit and Independence Testing

نویسندگان

  • Marco Gaboardi
  • Hyun-Woo Lim
  • Ryan M. Rogers
  • Salil P. Vadhan
چکیده

Hypothesis testing is a useful statistical tool in determining whether a given model should be rejected based on a sample from the population. Sample data may contain sensitive information about individuals, such as medical information. Thus it is important to design statistical tests that guarantee the privacy of subjects in the data. In this work, we study hypothesis testing subject to differential privacy, specifically chi-squared tests for goodness of fit for multinomial data and independence between two categorical variables.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Class of Private Chi-Square Tests

In this paper, we develop new test statistics for private hypothesis testing. These statistics are designed specifically so that their asymptotic distributions, after accounting for noise added for privacy concerns, match the asymptotics of the classical (nonprivate) chi-square tests for testing if the multinomial data parameters lie in lower dimensional manifolds (examples include goodness of ...

متن کامل

Local Private Hypothesis Testing: Chi-Square Tests

The local model for differential privacy is emerging as the reference model for practical applications of collecting and sharing sensitive information while satisfying strong privacy guarantees. In the local model, there is no trusted entity which is allowed to have each individual’s raw data as is assumed in the traditional curator model for differential privacy. Individuals’ data are usually ...

متن کامل

Estimate-based goodness-of-fit test for large sparse multinomial distributions

The Pearson’s chi-squared statistic (X2) does not in general follow a chi-square distribution when it is used for goodness-of-fit testing for a multinomial distribution based on sparse contingency table data. We explore properties of Zelterman’s (1987) D2 statistic and compare them with those of X2 and we also compare these two statistics and the statistic (Lr) which is proposed by Maydeu-Oliva...

متن کامل

Quantum Chi-squared and Goodness of Fit Testing

A quantum mechanical hypothesis test is presented for the hypothesis that a certain setup produces a given quantum state. Although the classical and the quantum problem are very much related to each other, the quantum problem is much richer due to the additional optimization over the measurement basis. A goodness of fit test for i.i.d quantum states is developed and a max-min characterization f...

متن کامل

Goodness-of-Fit Tests for Random Partitions via Symmetric Polynomials

We consider goodness-of-fit tests with i.i.d. samples generated from a categorical distribution (p1, ..., pk). We test the null hypothesis whether pj = qπ(j) for some label permutation π. The uncertainty of label permutation implies that the null hypothesis is composite instead of being singular. In this paper, we construct a testing procedure using statistics that are defined as indefinite int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016